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Abstract. We propose a liistory-based approximation of the Possible Worlds Se- 
mantics CPWiS) for reasoning about knowledge and action. A respective plan- 
^~> ning system is implemented by a transformation of the problem domain to an 

Answer-Set Program. The novelty of our approach is elaboration tolerant support 
for postdiction under the condition that the plan existence problem is still solvable 
in NP, as compared to E2 for non-approximated PWS of Son and Baral 1 19|. 
We demonstrate our planner with standard problems and present its integration in 
a cognitive robotics framework for high-level control in a smart home. 



-^ 1 Introduction 

^ Dealing with incomplete knowledge in the presence of abnormalities, unobservable pro- 

O cesses, and other real world considerations is a crucial requirement for real-world plan- 

ning systems. Action-theoretic formalizations for handling incomplete knowledge can 
fSJ be traced back to the Possible Worlds Semantics CPWS) of Moore lfT4l . Naive for- 

^ malizations of the VWS result in search with complete knowledge in an exponential 

•^ number of possible worlds. The planning complexity for each of these worlds again 

^T^ ranges from polynomial to exponential time 1 1 1 (depending on different assumptions 

and restrictions). Baral et al. [2| show that in case of the action language Ak the plan- 
ning problem is IJ2 complete (under certain restrictions). This high complexity is a 
problem for the application of epistemic planning in real-world applications like cogni- 



2f( live robotics or smart environments, where real-time response is needed. One approach 

,__! to reduce complexity is the approximation of VWS. Son and Baral |19| developed 

the 0-approximation semantics for ^fewhich results in an NP-complete solution for 

the plan existence problem. However, the application of approximations does not sup- 

S^ port all kinds of epistemic reasoning, like ipostdiction - a useful inference pattern of 

knowledge acquisition, e.g., to perform failure diagnosis and abnormality detection. 
Abnormalities are related to the iqualification problem: it is not possible to model all 
conditions under which an action is successful. A partial solution to this is lexecution 
monitoring (e.g. |T7l), i.e. action success is observed by means of specific sensors. If 
expected effects are not achieved, one can ipostdict about an occurred abnormality. 
In Section[3]we present the core contribution of this paper: a 'history' based approxima- 
tion of the VWS — called ih-approximation (T-LVX) — which supports postdiction. 
Here, the notion of history is used in an epistemic sense of maintaining and refining 
knowledge about the past by postdiction and commonsense law of inertia. For instance, 
if an agent moves trough a door (say at t = 2) and later (say sit — A) comes to know 
that it is behind the door, then it can postdict that the door must have been open at 



t = 2. Solving the plan-existence problem with h- approximation is in NP and finding 
optimal plans is in Z\|'. Despite the low complexity of HVX compared to ylJjit is 
more expressive in the sense that it allows to make propositions about the past. Hence, 
the relation between T-LVX and Ak is not trivial and deserves a thorough investigation 
which is provided in Section HI We extend Ak and define a itemporal query semantics 
{Ak^^^) which allows to express knowledge about the past. This allows us to show that 
1-LVX is sound wrt. a temporal possible worlds formalization of action and knowledge. 
A planning system for T-LVX is developed via its interpretation as an Answer Set 
Program (ASP). The formalization supports both sequential and (with some restric- 
tions) concurrent planning, and conditional plans are generated with off-the-shelf ASP 
solvers. We provide a proof of concept in SectionIS] 

2 Related Work 

Approximations of the VWS have been proposed, primarily driven by the need to re- 
duce the complexity of planning with incomplete knowledge vis-a-vis the tradeoff with 
support for expressiveness and inference capabilities. For such approximations, we are 
interested in: (i) the extent to which postdiction is supported; (ii) whether they are 
iguaranteed to be epistemically accurate, (iii) their itolerance to problem elaboration 
(13 1 and (iv) their icomputational complexity. We identified that many approaches in- 
deed support postdiction, but only in an ad-hoc manner: Domain-dependent postdiction 
rules and knowledge-level effects of actions are implemented manually and depend on 
correctness of the manual encoding. For this reason, epistemic accuracy is not guar- 
anteed. Further, even if postdiction rules are implemented epistemically correct wrt. a 
certain problem, then correctness of these rules may not hold anymore if the problem 
is elaborated (see Example [T]): Hence, ad-hoc formalization of postdiction rules is not 
elaboration tolerant. 

Epistemic Action Formalisms. Scherl and Levesque |fT8l provide an epistemic ex- 
tension and a solution to the frame problem for the Situation Calculus (SC) , and Patkos 
and Plexousakis 1 15 1 provide an epistemic theory for the Event Calculus. Demolombe 
and Parra |4| provide an approximate version of the epistemic SC which involves sim- 
pler frame axioms than those in ifTSl . such that an implementation is tractable. However, 
postdiction is not possible with this approach. Thielscher |20 1 describes how knowledge 
is represented in the Fluent Calculus. The implementation in FLUX is not elaboration- 
tolerant as it requires manual encoding of knowledge-level effects of actions. Liu and 
Levesque fTTl use a progression operator to approximate VWS. The result is a tractable 
treatment of the projection problem, but again postdiction is not supported. The PKS 
planner lfT6l is able to deal with incomplete knowledge, but postdiction is only sup- 
ported in an ad-hoc manner Vlaeminck et al. |23| propose a first order logical frame- 
work to approximate VWS. The framework supports reasoning about the past, allows 
for elaboration tolerant postdiction reasoning, and the projection problem is solvable in 
polynomial time when using their approximation method. However, the authors do not 
provide a practical implementation and evaluation and they do not formally relate their 
approach to other epistemic action languages. To the best of our knowledge, besides 
there exists no approach which employs a postdiction mechanism that is based on 



' Throughout the paper we usually refer to the full VWS semantics of .4fe. Whenever referring 
to the 0-approximation semantics this is explicitly stated. 



explicit knowledge about the past|^ There exist several PDDL-based planners that deal 
with incomplete knowledge. These planners typically employ some form of VWS se- 
mantics and achieve high performance via practical optimizations such as BDDs f3l or 
heuristics that build on a relaxed version of the planning problem [8J. The way how 
states are modeled can also heavily affect performance, as shown by To ETll with the 
iminimal-DNF approach. With T-LVX, we propose another alternative state representa- 
tion which is based on explicit knowledge about the past. 

The y^-Family of Languages. There exist different epistemic extensions to the ac- 
tion language A. Our work is strongly influenced by these approaches lfT2l [191 l22ll : 
Lobo et al. lfT2ll use epistemic logic programming and formulate a VWS based epis- 
temic semantics. The original Ak semantics is based on VWS and (under some restric- 
tions) is sound and complete wrt. the approaches by Lobo et al. 1 12 1 and Scherl and 
Levesque |18|. Tu et al. |22| introduce A'f, and add Static Causal Laws (SCL) to the 
0-approximation semantics of Ak- They implement A'f. in form of the ASCP planning 
system which - like T-LVX - is based on the translation to ASP. The plan-existence 
problem for A'j. is still NP-complete [22 J. The authors demonstrate that SCL can be 
used for an ad-hoc implementation of postdiction. However, to show that an ad-hoc re- 
alisation of postdiction is not lelaboration tolerant we provide the following example: 

Example 1 A robot can drive into a room through a door d. It will be in the room 
if the door is open: causes( driven, in,{opena} ). An auxiliary fluent did.drivcd represents 
that the action has been executed: causes(drived,did.drived3): A manually encoded SCL 
if(opend,{did^drived,in}) postdicts that if the robot is in the destination room after driving 
the door must be open. The robot has a location sensor to determine whether it arrived: de- 
terniines( sense.in, in). Consider an empty initial state Stnit = 0, a door d = 1 and a sequence 
a — [drivei; sense.in]. Here Al. correctfy generates a state 5' D {openi} where the door is 
open if the robot is in the room. Now consider an elaboration of the problem with two doors 
(d G {1, 2} J and a sequence a — [drivei; drive2', sense.in]. By Definitions 4-8 and the closure 
operator CLo in [221, Al produces a state S D {openi, open2} where the agent knows that 
door 1 is open, even though it may actually be closed: this is not sound wrt. VV^S semantics. 

Another issue is iconcurrent acting and sensing. Son and Baral lfT9l (p. 39) describe 
a modified transition function for the 0-approximation to support this form of concur- 
rency: they model sensing as determining the value of a fluent after the physical effects 
are applied. However, this workaround does not support some trivial commonsense in- 
ference patterns: 

Example 2 Consider a variation of the Yale shooting scenario where an agent can sense whether 
the gun was loaded when pulling the trigger because she hears the bang. Without knowing 
whether the gun was initially loaded, the agent should be able to immediately infer whether or 
not the turkey is dead depending on the noise. This is not possible with the proposed workaround 
because it models sensing the value of a fluent after the action was executed and in all cases 
the gun is unloaded after shooting. T-LVX allows for such inference because here sensing yields 
knowledge about the value of a fluent at the time it is sensed. 



~ In the review of an earlier / preliminary version of this paper, we received a reviewer comment 
informing us about a similar approach by Gelfond and Lifschitz in a paper titled: "A Language 
to Reason Backwards in Presence of Incomplete Information". However, Gelfond confirmed 
that they did not write such a paper. We have also been unable to locate such an article via 
other sources. 



3 h-approximation and its Translation to ASP 

The formalization is based on a foundational theory F^apx and on a set of translation 
rules T that are applied to a planning domain V. V is modelled using a PDDL like syn- 
tax and consists of the language elements in (fTafTB as follows: Value propositions (VV) 
denote initial facts ( [Ta| ; Oneof constraints {OC) denote exclusive-or knowledge ( [Tb] ); 



Goal propositions (^) denote goala^ ( Ici; Knowledge propositions (KV) denote sens- 
ing ( Idi; Executability conditions (£XC) denote what an agent must know in order to 



execute an action ( le i; Effect propositions (EV) denote conditional action effects ([TB. 



(:init i'"'') (la) (oneof i°° ... Z°° ) (lb) (: goal type (and /f ... i^ ) ) (Ic) 

{:action a n A\ (:action a executable /i-^n (:action a leffect /iir\ 

: observe /) (and t;^ ■ ■ ■ ''n ^i when (and L^...i^) I ) 

Formally, a planning domain P is a tuple {X, A, Q) where: 

- X is a set of value propositions (jTal and oneof-constraints (ITbl 

- ^ is a set of actions. An action a is a tuple (f 7-'",/C'P",f A'C") consisting of a set of 
effect propositions EV^ |TB, a set of knowledge propositions fCV^ \id\ and an executability 
condition EXC ([Ie|. 

- tj is a set of goal propositions (jTcl. 

An ASP translation of V, denoted by LP(7'), consists of a domain-dependent theory 
and a domain-independent theory: 

- Domain-dependent theory (r^orid)'- It consists of a set of rules Pint representing initial 
knowledge; Fact representing actions; and Fgoais representing goals. 

- Domain-independent theory {FhapxY This consists of a set of rules to handle inertia {Fin); 
sensing (r"se„); concurrency {Fconc), plan verification iFverify) as well as plan-generation 
& optimization (Fpian)- 

The resulting Logic Program LP(7-') is given as: 

LPyP) — [ Fin U Fg^n U Fconc U Fyerify U Fpian ] U [ Fini U Fact U Fgoal] (2) 

Notation. We use the variable symbols A for action, EP for effect proposition, KP for Icnowledge 
proposition, T for time (or step), BR for branch, and F for fluent. L denotes fluent literals of the 
form F or ^F. L denotes the complement of L. For a predicate p (. . ., L, . . .) with a literal 
argument, we denote strong negation "— " with the ^ symbol as prefix to the fluent. For instance, 
we denote -knows (F, T, T, BR) by knows (^ F,T,T,BR).|l| is used to "positify" a literal, 
i.e. |^f| — F and |f| = F. Respective small letter symbols denote constants. For example 
knows (l,t,t' ,br) denotes that at step t' in branch br it is known that literal I holds at step t. 

3.1 Translation Rules: {V h-^ r-u,orid) 

The domain dependent theory Fi^orid is obtained by applying the set of translation rules 



T — {Tl ... T8 }• on a planning domain V. 

Actions / Fluents Declarations (ti). For every fluent / or action a, LP(7') contains: 

fluent{f). action{a). (Tl) 

T2-T3 

Knowledge (I i — > Fini). Facts Fini for initial knowledge are obtained by applying 
translation rules ( |T2[|T3[ ). For each value proposition ([Taji we generate the fact: 

know s{V"'\ 0,0,0). (T2) 



' type is either weak or strong. A weak goal must be achieved in only one branch of the 
conditional plan while a strong goal must be achieved in all branches (see e.g. L3J). 



For each oneof-constraint ( lb i with the set of Uterals C 
every Hteral I 



eC: 
knowsQr, 0, r, BR) ^ knows{l+ , 0, T, BR), ..., knows{l^ 
knows{l+ , 0, T, BR) <- knowsQr, 0, T, BR). 
knowsQf , 0, T, BR) -i- knows{lT , 0, T, BR). 



{11'^ . . . l'^}, we generate for 



,0,T,BR). (T3a) 
(T3b) 



where {l^ , . . . ,1^ } = C\Z°^. (T3ai denotes that if all literals except one are known 
not to hold, then the remaining one must hold. Rules ( T3b| i represent that if one literal 
is known to hold, then all others do not hold. At this stage of our work we only support 
static causal laws (SCL) to constrain the initial state, because this is the only state in 
which they do not interfere with the postdiction rules. 

T4-T7 

Actions {A i — > Pact)- The generation of rules representing actions covers exe- 
cutability conditions, knowledge-level effects, and knowledge propositions. 
Executability Conditions. These reflect what an agent must know to execute an action. 



Let EXC^ of the form (le i be the executability condition of action a in V. Then LP(7') 



contains the following constraints, where an atom occ{a,t,br) denotes the occurrence 
of action a at step t in branch br: 

<- occ{a, T, BR), not knows{lT ,T, T, BR). 

^ occ{a,T,BR),not kn(yws{f^ ,T,T,BR). 

For every effect proposition ep E EV"", of the form ( when ( and 



(T4) 



/^„) T) , LP(7') contains (T5i, where hasPC/2 (resp. hasNC/2) 



Effect Propositions. 

/c re re 

1 • • • J np ^ J np-\-\ • • • 

represents postive (resp. negative) condition literals, hasEf f /2 represents effect literals 
and hasEP/2 assigns an effect proposition to an action: 



hasEP{a,ep). hasEff{ep,r). 

hasPC{ep,fi). .. .hasPC{ep,f^p). . . .hasNC{ep, f^p+i). . 

Knowledge Level Effects of Non-Sensing Actions. ( T6a|T6c|r 



. hasNC{ep, f^ 



knows{l^,T + 1, Tl, BR) ^-apply{ep, T, BR), Tl > T, 

knows{ll, T, Tl, BR), ..., knows{ll, T, Tl, BR). 



knows{l^i,T,Tl,BR) 



-apply{ep,T,BR), 
knows{t,T + 1, Tl, BR), knows{F, T, Tl, BR). 



knows{ll' , T, Tl, BR) ^apply{ep, T, BR), knows{l^, T + 1, Tl, BR), 

knows{li + ,T, Tl, BR), ..., knows{l'i+ ,T, Tl, BR). 



(T5) 



(T6a) 



(T6b) 



(T6c) 



► Causal action effects (T6ai. If all condition literals 11 of an EP ( [T^ are known to 
hold at t, and if the action is applied at t, then at t' > i, it is known that its effects hold 
at i + 1. The atom apply (ep, t, br) represents that a with the EP ep happens at t in br. 

► Positive postdiction ( T6b[ ). For each condition literal If S {l^, . . . , I'j^} of an effect 
proposition ep we add a rule ( T6b i to the LP. This defines how knowledge about the 



condition of an effect proposition is postdicted by knowing that the effect holds after the 
action but did not hold before. For example, if at t' in br it is known that the complement 
I of an effect literal of an EP holds at some t < t' (i.e., knows (I, t, t' ,br) ), and if the 



The frame problem is handled by minimization in the stable model semantics (see e.g. 1101 ). 



EP is applied at t, and if it is known that the effect literal holds att + 1 (knows {l,t + 
l,t' ,br)), then the EP must have set the effect. Therefore one can conclude that the 
conditions {l^, . . . , ?£} of the EP must hold at t. 

► Negative postdiction (T6c i. For each potentially unknown condition literal /^^ € 
{11, ... , /Jj} of an effect proposition ep we add one rule (T6c i to the program, where 
{/^^, . . . , 11^} = {^J, . . . , I'nW^'i^ are the condition literals that are known to hold. 
This covers the case where we postdict that a condition must be false if the effect is 
known not to hold after the action and aU other conditions are known to hold. For 
example, if at t' it is known that the complement of an effect literal / holds at some t + 1 
with i + 1 < i', and if the EP is applied at t, and if it is known that all condition literals 
hold at t, except one literal l^^ for which it is unknown whether it holds. Then the 
complement of Z^^ must hold because otherwise the effect literal would hold at t + 1. 
Knowledge Propositions. We assign a KP ( [Td| to an action a using hasKP/2: 

^^ hasKP{aJ). ^ (T7) 

Goals (G I — > rgoai)- For literals l^^ ,..., l^^ in a strong goal proposition and l^^ , ..., l^ 
in a weak goal proposition we write: 

sGoal{T, BR) ^ knows{ll^,T, T, BR), ..., knows{l'J' ,T, T, BR), s{T),br{BR). (T8a) 
wGoal{T,BR) ^ knows{l'^<' ,T,T,BR), ...,knows{C^ ,T,T,BR),s{T),br{BR). (T8b) 

where an atom sGoal (t,6r) (resp. wGoal(i, &r)) represents that the strong (resp. 
weak) goal is achieved at t in br. 

3.2 Fhapx - Foundational Theory (F1-F5) 

The foundational domain-independent 'HPA'-theory is shown in Listing [T] It covers 
concurrency, inertia, sensing, goals, plan-generation and plan optimization. Line[T]sets 
the maximal plan length maxS and width maxBr. 

Fl. Concurrency (rconc) Line[3]applies all effect propositions of an action a if that 
action occurs. We need two restrictions regarding concurrency of non-sensing actions: 
effect similarity and effect contradiction. Two effect propositions are similar if they have 
the same effect literal. Two EPs are contradictory if they have complementary effect 
literals and if their conditions do not contradict (il. |4|. The cardinality constraint il. |5] 
enforces that two similar EPs (with the same effect literal) do not apply concurrently, 
whereas il. l6]restricts similarly for contradictory EPs. 

F2. Inertia (iln) Inertia is applied in both forward and backward direction simi- 
lar to |7|. To formalize this, we need a notion on knowing that a fluent is mot initi- 
ated (resp. terminated). This is expressed with the predicates kNotinit/kNotTerm|j 
A fluent could be known to be not initiated for two reasons: (1) if no effect proposi- 
tion with the respective effect fluent is applied, then this fluent can not be initiated. 
initApp (/, t, br) (il.l8]l represents that at t an EP with the effect fluent / is applied in 
branch br. If initApp if ,t,br) does not hold then / is known not to be initiated at t in 
br (il.|9|. (2) a fluent is known not to be initiated if an effect proposition with that fluent 
is applied, but one of its conditions is known not to hold (il. 10 1. Note that this requires 
the concurrency restriction (il.Bl. Having defined kNotinit/4 and kNotTerm/4 we 
can formulate forward inertia (il. 1 1 1 and backward inertia (il. 12 1. Two respective rules 
for inertia of false fluents are not listed for brevity. We formulate iforward propagation 
of knowledge in il. 13 That is, if at t' it is known that / was true at t, then this is also 
known at t' + 1. 



For brevity we omit the rules for kNotTerm resp. to ill. SllO 



Listing 1. Domain independent theory {Fhapx) 

s { . .maxS) . ss(0..maxS-l). br{0. .maxBr) . 

► Concurrency {Pconc) 

apply (EP, T, BR) :- hasEP(A,EP), OCC(A,T,BR). 
contra (EPl, EP) :- hasPC (EPl, F) , hasNC (EP, F) . 

:- 2{apply (EP,T,BR) :hasEff (EP,F) },br (BR) , s(T),f(F). 

:- apply (EP, T, BR) , hasEf f (EP,F) , apply (EPl, T, BR) ,hasEff (EPl, ^F) 

,EP != EPl, not contra (EPl, EP) . 

► Inertia (An) 

initApp (F, T,BR) : -apply (EP, T,BR) ,hasEff (EP,F) . 

kNotlnit (F,T,T1,BR) :- not initApp (F, T, BR) , uBr (Tl, BR) , s (T) , f (F) . 

kNotlnit (F,T,T1,BR) :- apply (EP, T, BR) , hasPC (EP, Fl ) , hasEf f (EP, F) 

, knows (^Fl, T, T1,BR) , T1>=T. 
knows (F, T+1, T1,BR) :- knows (F, T, Tl, BR) , kNotTerm(F, T, T1,BR) , T<T1, s (T) . 
knows (F, T-1, Tl, BR) :- knows (F , T, Tl , BR) , kNotlnit (F, T-1 , Tl , BR) , T>0, T1>=T, s(T). 
knows (L, T, T1+1,BR) : -knows (L, T, Tl.BR) ,Tl<maxS, S (Tl) . 

► Sensing and Branching {Fsen) 
uBr(0,0). uBr (T+1, BR) :- uBr(T,BR), s (T) . 
kw(F,T,Tl,BR) :- knows (F, T, Tl , BR) . 
kw(F,T,Tl,BR) :- knows (-iF, T, Tl, BR) . 
sOcc(T,BR) :- occ(A,T,BR), hasKP(A,_). 
leq(BR,BRl) :- BR <= BRl, br (BR) , br(BRl). 
l{nextBr (T, BR,BR1) : leq (BR, BRl ) } 1 :- sOcc(T,BR). 
:- 2{nextBr (T,BR,BR1) :br (BR) : s (T) ) , br (BRl ) . 
uBr(T+l,BR) :- sRes (^F, T, BR) . 

sRes(F,T,BR) :- occ (A, T, BR) , hasKP (A, F) , not knows (^F, T, T, BR) . 

sRes (^F, T,BR1) :- OCC (A, T, BR) , hasKP (A, F ) , not kw(F, T, T,BR) , nextBr (T, BR, BRl ) . 

knows (L,T, T+1, BR) :- sRes (L, T, BR) . 

knows (Fl, T, T1,BR1) :- sOcc (T1,BR) , nextBr (Tl, BR, BRl) , knows (F1,T,T1,BR), T1>=T. 

apply (EP, T, BRl) :-sOcc (T1,BR) , nextBr (Tl, BR, BRl) , uBr (T1,BR) , apply (EP, T,BR) , T1>=T. 

: -2 {occ (A, T,BR) ihasKP {A,_) } ,br (BR) , S (T) . 

► Plan verification {Fyerify) 

allWGsAchieved :- uBr (maxS, BR) , wGoal (maxS, BR) . 
notAllSGAchieved :- uBr (maxS, BR) , not sGoal (maxS, BR) . 
planFound :- allWGsAchieved, not notAllSGAchieved. 

:- not planFound. 

notGoal (T,BR) :- not wGoal(T,BR), uBr(T,BR). 

notGoal (T,BR) :- not sGoal(T,BR), uBr(T,BR). 

► Plan generation and optimization (Ppian) 

l{occ(A, T,BR) :a(A) )1:- uBr(T,BR), notGoal (T, BR) , br (BR) , ss (T) . % Sequential 
%1{0CC (A,T,BR) :a (A) } :- uBr(T,BR), notGoal (T, BR) , br (BR) , ss (T) . % Concurrent 
tminimize {occ(_,_,_) (3 1} % Optimal Planning 



F3. Sensing and Branching (i^sen) If sensing occurs, then each possible outcome 
of the sensing uses one branch. uBr (t, 6r ) denotes that branch br is used at step t. Pred- 
icate kw/4 in ill. T6p7 is an abbreviation for iknowing whether. We use sOcc {t,br) to 
state that a sensing action occurred at t in br (il. 18 1. By leq {br, br') the partial order 
of branches is precomputed (il.[T9|; it is used in the choice rule il.|20]to "pick" a valid 
child branch when sensing occurs. Two sensing actions are not allowed to pick the same 
child branch (il. 21 1. Lines 23p4 assign the positive sensing result to the current branch 
and the negative result to the child branch. Sensing results affect knowledge through 



il. 25 Line 26 represents inheritance: Knowledge and application of EPs is transferred 
from the original branch to the child branch (il.[27|. Finally, in line il.|28] we make the 
restriction that two sensing actions cannot occur concurrently. 

F4. 



Plan Verification {Fyenfy) Lines 30 



33 handle that weak goals must be achieved 
tranches. Information about nodes where 



in only one branch and strong goals in all 

goals are not yet achieved (ill. [34p5] l is used in the plan generation part for pruning 

and il. [38] implement se 



37 



F5. Plan Generation and Optimization (/],/„«) Line 

quential and concurrent planning respectively. Optimal plans in terms of the number of 

actions are generated with the optimization statement il.[39] 



3.3 Plan Extraction from Stable Models 

A conditional plan is determined by a set of occ/3, nextBr/3 and sRes/3 atoms. 
Definition 1 (Planning as ASP Solving) Let S be a stable model for the logic program LP{V), 
then p solves the planning problem V ifp is exactly the subset containing all occ/3, nextBr/3 
and sRes/3 atoms of S. 

For example, consider the atoms occ (ao,t, br), sRes (/, t, br) , sRes (^/, t, br') , 
next Br {t,br,br'), occ (ai,t + l,br) and occ (02,* + 1, br'). With a syntax as in ll22ll . 
this is equivalent to the conditional plan ao ;[ i f / then ai else 02]. 

3.4 Complexity of h-approximation 

According to |22|, we investigate the complexity for a limited number of sensing ac- 
tions, and feasible plans. That is, plans with a length that is polynomial wrt. some con- 
stant representing the size of the input problem. 

Theorem 1 ((Optimal) Plan Existence) The plan existence problem for the h-approximation 
is in NP and finding an optimal plan is in A2 ■ 
Proof Sketch: The result emerges directly from the complexity properties of ASP (e.g. (6)). 

1. The translation of an input problem via (ITTlTSj is polynomial. 

2. Grounding the normal logic program is polynomial because the arity of predicates is fixed. 

3. Determining whether there exists a stable model for a normal logic program is NP-complete. 

4. Finding an optimal stable model for a normal logic program is Zil' -complete. 

3.5 Translation Optimizations 

Although, optimisation of HVX is not in the focus at this stage of our we want to 
note two obvious aspects: (1) By avoiding unnecessary action execution, e.g. opening 
a door if it is already known to be open, search space is pruned significantly. (2) Some 
domain specificities (e.g., connectivity of rooms) are considered asemphstatic relations. 



For these, we modify translation rules (T4i (executability conditions) and (T2i (value 



propositions), such that knows/4 is replaced by holds/1. 

4 A Temporal Query Semantics for A-k 

T-iVX is not just an approximation to VWS as implemented in Ak- It is more expressive 
in the sense that l-LVX allows for propositions about the past, e.g. "at step 5 it is known 
that the door was open at step 3". To find a notion of soundness of T-LVX with Ak 
(and hence T'lViS-based approaches in general), we define a itemporal query semantics 
(Ak^^^) that allows for reasoning about the past. The syntactical mapping between Ak 
and T-LVX is presented in the following table: 





Ak 


■HVX PDDL dialect 






Value prop. 


initially(r'"*) 


(:lnit i'"") 


Effect prop. 


causes(a,r,{/f .../,1}) 


(: action a : effect when 


(and I'i . 


.11) l") 


Executability 


executable(a, {/r,---,C}) 


(: action a : executable 


and l^ . . 


.11^)) 


Sensing 


determines (a,{/,^/}) 


(: action a : observe /) 



An Ak domain description D can always be mapped to a corresponding T-LVX domain 
specification due to the syntactical similarity. Note that for brevity we do not consider 
executability conditions in this section. Their implementation and intention is very sim- 
ilar in h-approximation and Ak- Further we restrict the Ak semantics to allow to sense 
the value of only one single fluent with one action. 

Original Ak Semantics by Son and Baral [19|. Ak is based on a transition function 
which maps an action and a so-called c-state to a c-state. A c-state 5 is a tuple (m, S), 



where u is a state (a set effluents) and Z" is a k-state (a set of possible belief states). If a 

fluent is contained in a state, then its value is true, and false otherwise. Informally, u 

represents how the world is and S represents the agent's belief. In this work we assume 

grounded c-states for Ak, i-C- <^ — {u, S) is grounded if u g S. The transition function 

for non-sensing actions and without considering executability is: 

${a,l^u,E)) = (^Res{a,u),{Res{a,s)\s G X'}) where (3) 

i?es(a, s) = sU£+(s) \_B^(s) where (4) 

Ea{s) — {/I / is the effect literal of an EP and all condition literals hold in s.} 

Ea (s) = {^/l ^/ is the effect literal of an EP and all condition literals hold in s.} 

Res reflects that if all conditions of an effect proposition hold, then the effect holds in 

the result. The transition function for sensing actions is: 

<?(a, {u, E)) = (u, {si(s G r) A (/ G s ^ / G u)}) (5) 

For convenience we introduce the following notation for a k-state S: 

S'^ f iff Vs e Z" : / e s and Z" h -/ iff Vs S Z : / n s = (6) 

It reflects that a fliient is known to hold if it holds in all possible worlds s in Z. 

Temporal Query Semantics - Ak^'°^^ Our approach is based on a re-evaluation step 

with a similar intuition as the update operator "o" in f23!|: Let Zq = {sq, ■ • ■ , Sq " } 

be the set of all possible initial states of a (complete) initial c-state of an Ak domain 

D. Whenever sensing happens, the transition function will remove some states from 

the k-state, i.e. ^([ao; . . . ; a„], Jq) = ("««, ^n), where Z„ = {s° , . . . , sk "'} wifli 
1^0 1 ^ l-^nl- To reason about the past, we re-evaluate the transition. Here, we do 
not consider the complete initial state, but only the subset Zq of initial states which 
"survived" the transition of a sequence of actions. If a fluent holds in all states of a 
k-state Z", where Z" is the result of applying t <n actions on Zq , then after the n-th 
action, it is known that a fluent holds after the t-ih action. 

Definition 2 Let a — [ao; . . . ;a„] fee a sequence of actions and So be a possible ini- 
tial state, such that <f ([oq; . . . ; a„], (Sq) = 5^ ~ {u„, Sn). We define Eg as the set of 
initial belief states in Eq which are valid after applying a: Eq — {sojso G Eo A 
Res{a„, ResicLn-i, ■ ■ ■ , Res{ao, so) ■■■))& E„}r\We say that 

{I, t) is known to hold after a on 5o 
ifEt 1= / where {ut, X"") = <^([ao; . . . ; at], {uq, Eq)) and t <n 

Observation 1 If after n actions a fluent is known to be true at t after n actions, then it is still 
known to be true at t after n + l actions, because sensing always generates knowledge and hence 
reduces the set of belief states (E^^^ C E"). That is, the set of possible belief states for all steps 
t < n is shrinking monotonically with n: if E^ \— I then E^ |= /. 

Soundness wrt. Ak'^'^^- The following conjecture considers soundness for the pro- 
jection problem for a sequence of actions r] 

* Consider that according to Mb Res{a, s) = s if a is a sensing action. 

' A revised version of this paper may contain a corresponding theorem instead of a conjecture. 
A proof requires induction over n. The induction has to be generalized and classes of literals 
have to be considered. This is necessary to account for cyclic dependencies; these are generated 
e.g. by the interplay of postdiction and causality rule which trigger each other. 

Extending the soundness proof to cover conditional plans can be done via induction over 
the structure of plans, similar to the soundness proof for the cj-approximation in 1191 . This also 
requires a formal mapping from stable models to conditional plans as sketched in Subsection 
331 



Conjecture 1 Let D be a domain specification and a = [ai ; . . . ; a„] be a sequence of ac- 
tions. Let LP{D) — [Fin U -Tsen U Fconc U Fini U Fact] be a HPX-logic program without 
rules for plan generation (Fpian), plan verification (F^^rify) and goal specification (Fgoai). Let 
Pocc contain rules about action occurrence in valid branches, i.e. Face = {occ{ao, 0, BR) -^ 
uBr{Q, BR)., . . . , occ{an, n, BR) -f- uBr(n, BR).} Then for all fluents f and all steps t with 
< t < n, there exists a branch br such that: 

if knows (I, t, n, br ) G SM[LP{D) U /""^J then E" \= I with t < n. (7) 

where SM[LP{D) U F^cc] denotes the stable model of the logic program. 

The following observation is essential to formally investigate soundness: 
Observation 2 We investigate F^p^ (ListingUj and Florid orid observe that an atom 
knows (f,t,n,br) can only be produced by (a) Initial Knowledge \T2\ (b) Sensing (il. |25p 
(c) Inheritance (il. \26\ (d) Forward inertia (il. \ll\ (e) Backward inertia (il. \12\ (f) Forward prop- 
agation (il. \13\ (g) Causality {r6a\ (h) Positive postdiction \T6b\ or (i) Negative postdiction 



Soundness: To demonstrate soundness we would investigate each item i^m in Observation 
[2] and show that if knows (/, t, n, br) G SM[LP{D) U F"cc] produced by this item, then 
X'" j= / must hold for some br . However, for reasons of brevity we consider only some examples 
i(|b]|e][hl for positive literals / and without executability conditions: 

1. Sensing iJEl. The soundness proof for sensing is by induction over the number of sensing 
actions. For the base step we have that br = (il.|15[l. A case distinction for positive (/ £ it) 
and negative (/ ^ it) sensing results is required: With (ill. |23|24p the positive sensing re- 
sult is applied to the original branch br and the negative result is applied to a child branch 
determined by nextBr/3. The hypothesis holds wrt. one of these branches. The Ak re- 
striction that sensing and non-sensing actions are disjoint ensures that the sensed fluent did 
not change during the sensing. Hence, its value after sensing must be the same as at the time 
it was sensed. This coincides with our semantics where sensing returns the value of a fluent 
at the time it is sensed. 

2. Backward Inertia i(|el. Backward inertia (fl.[T2j generates knows ( f , t , n , br) with t < n 
if both of the following is true: 

A: knows {f,t+l,n,br) is an atom in the stable model. If this is true and we assume 

that the conjecture holds for i + 1, then X'^i j= /. 
B: kNotlnit if ,t,n,br) is an atom in the stable model. This again is only true if i(i) 
no action with an EP with the effect literal / is applied at t (ill. |8|9[ l or i(ii) an action 
with an EP with the effect literal / is applied at t, but this EP has at least one condition 
literal which is known not to hold (il.|IO[l. As of the result function pi this produces in 
both cases that Vsr G ^T ■ E+{s]^) = 0. 
With A: XT+i |= / and B: Vs? G ^? : ^<^(s?) = 0, we can teU by the transition function 
pb that 17" 1= / and the case of backward inertia is conditionally proven if the conjecture 
holds for knows (/, i + 1, n, br) . 

3. Positive Postdiction illhl. Positive postdiction l|T6bl generates an atom 
knows ifi ,t,n,br) if apply (ep, t, for) , knows {f^ ,t + l,n,br) and 
knows (f*^ ,t,n,br) with t < n and where /f is a condition literal and f^ is an 
effect literal of ep. We can show that positive postdiction generates correct results for the 
condition literals if ConjecturefTl holds for knowledge about the effect literal: That is, if we 
assume that i(i) S^^^ [= /" and i(ii) T" j= J^, then with the result function ^, i(i) and 
i(ii) can only be true if Ea{s") — f for all s" G E^. Considering the restriction that only 
one EP with a certain effect literal /^ may be applied at once (il.pll, E^ (s") = S^ can only 
hold if for all conditions /f : E^ |= /f . 

The case for causality, negative postdiction, forward inertia, etc. is similar. 




[Si] [S2] [S3] [S4] 

Fig. 1. The wheelchair operating in the Smart Home 

5 Evaluation and Case- Study 

In order to evaluate practicability of T-LVX we compare our implementation with the 
ASCP planner by Tu et al. |22| and show an integration of our planning system in a 
Smart Home assistance system. 

Comparison with ASCP. We implemented three well known benchmark problems 
for HVX and the 0-approximation based ASCP plannerfliBomb in the toilet (e.g. [SI; 
n potential bombs need to be disarmed in a toilet), iRings (e.g. |3|; in n ringlike con- 
nected rooms windows need to be closed/locked), and iSickness (e.g. 1221 : one of n 
diseases need identified with a paper color test). While T-LVX outperforms ASCP for 
the Rings problem (e.g. w 10s to 170s for 3 rooms), ASCP outperforms T-LVX for the 
other domains (e.g. w 280s to 140s for 8 bombs and « 160s to 1360s for 8 diseases). 
For the first problem, TiVX benefits from static relations and for the latter two prob- 
lems ASCP benefits from a simpler knowledge representation and the ability to sense 
the paper's color with a single action where TiVX needs n — 1 actions. 
Application in a Smart Home. The TiVX planning system has been integrated within 
a larger software framework for smart home control in the Bremen Ambient Assisted 
Living Lab (BAALL)l|9l. We present a use-case involving action planning in the pres- 
ence of abnormalities for an robotic wheelchair: The smart home has (automatic) sliding 
doors, and sometimes a box or a chair accidentally blocks the door such that it opens 
only half way. In this case, the planning framework should be able to postdict such an 
abnormality and to follow an alternative route. The scenario is illustrated in Fig.fT] 
Consider the situation where a person instructs a command to the wheelchair (e.g., to 
reach location: [^o]). An optimal plan to achieve this goal is to pass Dl. A more error 
tolerant plan is: Open Dl and verify if the action succeeded by sensing the door status 
[5*1]: ilf the door is open, drive through the door and approach the user lElse there is 
an abnormality: Open and pass D3 [5*2]; drive through the bedroom [S'3]: pass D4 and 
D2 [54]: and finally approach the sofa [S'slnlf it is behind the door then the door was 
open. For this particular use-case, a sub-problem follows: 



(:action open_door leffect when ^ab_open open) 
(:action drive iprecondition (and open ^in_liv) 
(: action sense_open : observe open) 
(:init ^in_liv ^open) (:goal weak in_liv) 



: effect in_liv) 



' We used an Intel i5 (2GHz, 6Gb RAM) machine running iclingo f6l with Windows 7. 
' Abnormalities are considered on the alternative route but skipped here for brevity. 
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Fig. 2. Abnormality detection as postdiction with h-approximation 

The solution to this subproblem is depicted in Fig.l2](see also state 5*1 in Fig. [Til. There is 
an autonomous robotic wheelchair outside the living room (^in_liv) and the weak goal 
is that the robot is inside the living room. The robot can open the door (open_door) to 
the living room. Unfortunately, opening the door does not always work, as the door may 
be jammed, i.e. there may be an abnormality. However, the robot can perform sensing 
to verify whether the door is open (sense_open). Figure l2] illustrates our postdiction 
mechanism. Initially (at i = and br — 0) it is known that the robot is in the corridor at 
step 0. The first action is opening the door, i.e. the stable model contains the atom 
occ (open_door, 0, 0) . Inertia holds for ^in_liv, because nothing happened that 
could have initiated ^in_liv. The rules in ill. 8]|9 trigger kNotinit (in_liv, 0, 0, 0) 
and il. 
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triggers knows (^in_liv, 0, 1, 0) , such that in turn the forward inertia 
rule (il. [TTi causes atom knows (^in_liv, 1,1,0) to hold. Next, sensing happens, 
i.e. occ (sense_open, 1,0). According to the rule in il.|23] the positive result is as- 
signed to the original branch and sRe s ( open ,1,0) is produced. According to the rule 



in il. 24 the negative sensing result at step t in branch hr is assigned to some child 
branch hr' (denoted by nextBr (t, br, hr') ) with br' > hr (il. 20 1. In the example we 
have: sRes (^open, 1,1), and due to il.|25]we have knows (^open, 1,2,1). This re- 



sult triggers postdiction rule ( T6c i and knowledge about an abnormality is produced: 
knows (ab_open, 0,2,1). Consequently, the wheelchair has to follow another route 
to achieve the goal. For branch 0, we have knows (open, 1,2,0) after the sensing. 
This result triggers the postdiction rule (T6bi: Because knows (^open, 0,2,0) and 
knows ( open , 1 , 2 , ) hold, one can postdict that there was no abnormality when open 
occurred: knows (^ab_open, 0,2,0). Finally, the robot can drive through the door: 
occ (drive, 2,0) and the causal effect rule ( |T6a[ ) triggers knowledge that the robot is 
in the living room at step 3: knows (in_liv, 3,3,0). 

6 Conclusion 

We developed an approximation of the possible worlds semantics with elaboration tol- 
erant support for postdiction, and implemented a planning system by a translation of 
the approximation to ASP. We show that the plan existence problem in our framework 
can be solved in NP. We relate our approach to the VWS semantics of Ak by extend- 
ing Ak semantics to allow for temporal queries. We show that HVX is sound wrt. this 
semantics. Finally, we provide a proof of concept for our approach with the case study 
in Section |5] An extended version of the Case Study will appear in fSl. Further testing 
revealed the inferiority of the T-LVX implementation to dedicated PDDL planners like 
CFF Is). This result demands future research concerning the transfer of heuristics used 
in PDDL-based planners to ASP. 
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